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1. INTRODUCTION 

Among the grounds discussed in the field of information security is the cryptography and data hiding. 
Cryptography protects information by coding its content to become incomprehensible to unauthorized people. 
But, even an incomprehensible message may attract the attention of eavesdroppers. To overcome this hindrance, 
steganography offers the aspect of ” secrecy” rather than ”incomprehensibility”, it is the technique of concealing 
information through digital media. Its main objective is the secrecy of the information’s existence so that no one 
aside the authorized recipient may suspect it. In steganography, various media files have been utilized as cover 
file such as audio, image, video and plain text, but among these media, the most popular one to dissimulate 
secret information is image since it is shared everyday on networks and also because the human visual system 
can not detect slight changes done on the image intensity. 

Generally, there are two approaches of steganography: the spatial domain and the frequency domain. 
In the spatial domain approach, data is hidden directly in the pixels of the cover image, such as the Least 
Significant Bit (LSB) Substitution” [1], [2]. However, this technique is vulnerable to statistical attacks, to 
protect the hidden message, authors in [3] proposed to encode it using the Hungarian puzzle. There is also the 
interpolation based techniques [4], [5], [6] where data is hidden in the error between the initial pixels and the 
interpolated pixels, and the Pixel Value Differencing (PVD) [7], [8], [9] where data is hidden in the difference 
between each neighbouring pixels. In [10], authors proposed a steganographic algorithm based on Ant Colony 
Optimization (ACO), the ACO algorithm is used to select the complex regions of the cover image, then data is 
hidden using the LSB substitution in the selected areas. 

In the frequency domain based techniques, data is hidden in the coefficients of the transform domain, 
such as Discrete Cosinus Transform (DCT) and Discrete Wavelet Transform (DWT). In [11], a reversible data 
hiding for JPEG images is proposed, the quantization table is modified through dividing some of its elements 
by an integer while multiplying the corresponding quantized DCT coefficients by the same integer in order to 
create space for the dissimulation. In [12], authors proposed a steganographic scheme for JPEG that preserves 
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the DCT coefficients histogram in order to resist steganalysis based attacks. The scheme distinguishes sensitive 
pixels and protects them from the extra bit embedding to reduce the distortions in the histogram. In [13], 
the Integer Wavelet Transform (IWT) is applied to each 8x8 blocks of the cover image, then the zero tree 
method is utilized to select the proper location where data can be embedded. In [14], a steganographic scheme 
based on the Haar DWT is proposed, data is hidden in the first LSB of the DWT coefficients, the algorithm is 
generalized on K-LSB with the use of the Optima Pixel Adjustment OPA procedure in [15]. In [16], the edge 
IWT coefficients are classified based on their Most Significant bit (MSB), the size of data to be hidden in the 
coefficient is determined based on the value of the coefficient’s MSBs. 

In this paper, we propose a steganographic scheme based on the Faber Schauder DWT, this transform 
allows us to hide data in the integer part without worrying about the problem of the floating point (the pixels of 
the stego image are guaranteed to be integers). Data is hidden in the LSBs of the transform details. The message 
and the coefficients LSBs are decomposed to pairs of bits ™, and Z,, and based on the matrix that illustrates 
the difference of distance between m; and Z;, we search for the permutation that transforms the message into 
the binary sequence that has the most possible matches with Z,. The selection order of the coefficients where 
to dissimulate data is given aleatory by a random key. Experiments were performed on a large set of a variety 
of images to assess the perfomance of the proposed work, and comparison to prior works is accomplished. 
Results indicate good level of imperceptibility and trade-off capacity-imperceptibility. 

The remaining of the paper is organized as follows: Section 2 details the algorithms of decomposition 
and reconstruction of the Faber-Schauder DWT. In section 3, the proposed steganography method is explained. 
In section 4, experimental results of the test and comparison are discussed. Section 5 concludes the paper. 


2. FABER-SCHAUDER DWT 
The Faber-Schauder Wavelet transform is a multi-scale transform, the multi-scale analysis is formu- 
lated based on the study of compactly supported wavelet bases, it is the main theory in wavelets that analyzes 


in detail a signal in the frequency domain. Multi-Scale Analysis of L? (R) is a sequence of nested vector spaces 
(V5) jez (++ C Viz C Vja C V C Vj- Ca) 


For all j in Z, Vj41 C V;. Let Wj4ı be a supplementary of Vj+1 in V; : (V; = Vj41 ® Wj+1), the 
basis of Faber-Schauder is the basis of W;+1 given by the family of functions CAR ez, With: pi = gh, 441 
where: 


l l 1+t if —-1<t<0 
pi (t)=2%p(2%t—n), andy(t)={ 1-t if O<t<1 
0 if t é |—1,1] 


In two dimensions, L? (R?) is approximated by using the tensor product: 


Vj = Vj @ Vj = (Viy © Wy 41) @ (Vig © Wy41), where: Vj41 = Vj41 @ Vj4i, and 


Wy41 = (Vii 8 W541) + (W541 8 Vita) + (Wiji 8 W541) 


Thus, an image C is decomposed into four blocks: A, H, V and D as illustrated in Fig. 1. A corresponds 
to V;41, while H, V, and D correspond respectively to the three subspaces of W;+1. 
The decomposition algorithm of Faber-Schauder DWT is given by the following equations: 


Ali, j) = C(2i,27) 
H(i,j) = C(2i+1,23) zee eae) 
2i, 2j) + C(2i, 27 +2 
1 1 ? , 
ij C(2i + 2k, 27 + 2r) 
D(a, = C(2i+1,2j7+1)- 
(i, j) ( j+1) wy : 


The reconstruction algorithm or inverse Faber-Schauder Discrete Wavelet Transform is given by the 
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Figure 1. One level 2D Faber-Schauder DWT of the image Baboon 
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3. PROPOSED WORK 
3.1. Embedding Process 


After the dissimulation of the secret message, the cover image is modified, the principal objective of 
steganography is to minimize this modification so that the hiding does not arise the suspicion of eavesdroppers. 
The proposed method is based on Faber-Schauder DWT. The block A represents the approximation of the 
image cover, it contains the low frequencies where the human eye is sensitive to slightest modifications. A 
should remain unchanged. Therefore, data is hidden in the three remaining blocks H, V, and D. 

Let m be the binary sequence of the secret message m = {m1,..., ML} Mk € {0, 1}, and let Z be 
the set of the LSB of the integer part of the coefficients of the blocks H, V and D. We divide m and Z into 
pairs m = Um, and Z = |]J Zk, where Mk = {M2k—1, M2k } and Zk = {22k—1, Zan}. Vk My, and Zķ are in 
the set Æ = {{0,0}, {0,1}, {1,0}, {1, 1}}. 

When ™; is hidden into Z; there are three possibilities: they are identical m, = Zķ, conjugate 
Mk + Zk = {1,1} or they have one bit different. To visualize all these possibilities, we introduce the matrix 
G whose elements denote the number of times each pair ™; from E encounters a pair Zą from E. G is a 4x4 
matrix, because card(E) = 4, its elements are integer between 0 and L/2, G € My 4 ({0,...,£/2}). The 
first column is associated to {0,0}, the second column to {0, 1}, the third column to {1,0} and the last column 
to {1,1}. The same thing goes for the rows. If mye encounters Zg, then the element Gj, j from the 7 row and 
j column is incremented, where the relation between the pairs, rows and columns is given by the following 
function f which is a bijection that associates each pair to its decimal value plus 1: 
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j f(Mmok-1; Mək) = 2Mək-1 + Mək +1 
i = f(z2k-1, 22k) = 2zək-1 + Zor +1 


(2) 


In each column, the diagonal element show how much times my and Zę are identical, the element of 
the 2nd diagonal denotes the number of times when Mmg and Z, are opposite or conjugate. The remaining two 
elements denote how much times mą and Z; have on bit different. 

Example: 26 denotes how many times {0,1} of the message encounters {1,0} in the the coefficients 
LSBs. 


{0,0} {0,1} {1,0} {1,1} 
{0,0} 64 36 87 51 
G=| {0,1} 38 57 £472 9 
{1,0} 19 26 18 17 
{1,1} 29 16 87 21 


The error generated by the dissimulation is expressed by the following expression: 


1N 


1 M— —1 _ oes 
MSE = -7y > > (S(i, j) — C(i, j)) 


S 


Let H’(i, j) be the coefficient produced after hiding data in H (i, j), and h(i, j) = H' (i,j) — H(i, j) 
the difference coming from this dissimulation. We define v(i, j) and d(i, j) the same way. Therefore, using the 
reconstructions equations (1), the MSE becomes: 


1 M/2—1 N/2—1 
O -A2 - 2 E +\2 
MSE=7 > 2 h(i, j) + v(i, j) + d(i, j) 


The diagonal elements of the matrix G corresponds to when data is hidden with zero changes, the 
2nd diagonal elements corresponds to when 2 changes are needed, and the rest corresponds to when one bit is 
changed to hide 2 bits, as described in the following matrix Wg. 


2 
1 
We = 1 


2 0 


which we can reformulate as follows: 


4 4 4 


1 
MSE = MN > 9 Gij +) Gsis — tr(G)) 
S g=1 i=} 
4 
where tr(G) is the trace of the matrix G: tr(G) = 5 Gii 
i=1 
4 4 
Since N. y Gi j is unchanged for all permutations, then we define the function ? which associates 
i=1 j=1 


the remaining two terms to the permutation: 


0: 5 — Z 
4 


p laama 5 Gsaii == tr(G) 
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To minimize the MSE, we calculate (p) for all possible p in S4 (24 permutations). Then, we search 
for the permutation p* corresponding to the lowest value of (p). p* = min(0(p)). The equation p*(j) = j’ 
pesa 


signifies that the column j is permuted into 7’, which means that the pair mg associated to j by the function f 
introduced in (2) is consequently changed to the pair associated to 7’. 

For example, if p* (3) = 1, then by using the function f, f~+(3) is changed into f~'(1) i.e. each pair 
{1,0} in the secret message is changed into {0,0}. Hence, we obtain the transformation of the secret message 
m/’ that allows us to reach the lowest MSE calculated. m’ is given by: 


L/2 


m = U fto p* o f (ix) (3) 


k=1 


Example: We consider the message ” HELLO”, the binary sequence of this message is 
m = 0100100001100101011011000110110001101111 


We decompose m into pairs: m = 01|00|10|00|01|10|01|01|01|10|11|00|01|10|11|00|01|10|11|11. 
Suppose that the LSBs of the coefficients are : 


Z = 00|11|00/01|11|01|10|00|10|10|10]11|11|10|00|10|10|1000) 10. 


We construct the matrix G: 


OFF oO 
NUON 
Owenem 
Ou O M 


The error now is 25, which means that there is 25 among the 40 message bits that are going to be 
dissimulated into their opposite bits of the coefficients. Now, we calculate the errors of the 24 permutations 
and we choose the permutation p* associated to the lowest error. p* and its associated G* are given by 


s [/1234 px 
P ee 


The error becomes 15. Hence, we construct the new binary sequence m’ based on the equation (3) as 
follows: 


Conoco fl 
Ower 
O Ww orn 
verno 


{0,0} > {1,1} ; {0,1}— {1,0} ; {1,0}— {0,1} ; {1,1} — {1,1} 


3.2. Extraction process 


In the extraction, we retrieve the message m’ from the LSB of the coefficients’ integer part. To be able 
to obtain the actual message m, the permutation p* is needed. Thus, in the dissimulation phase, we hide an 
identifier in the first coefficients. p*(1), p* (2), p*(3) and p* (4) are hidden int the first eight coefficients. After 
the extraction of m’, we use the p* (i) to retrieve the message m as follows: 


m = ! ! 
where M'k = {M3 M3k41} 
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Embedding algorithm 
e Read the cover image as two dimensional file. 


Perform the Faber-Schauder DWT. 


Construct the matrix G, find the permutation p* and hide p* (1), p*(2), p*(3) and p* (4) in the first eight 
coefficients. 


Transform the binary sequence of the message m into m’ using p* and f and hide it in the coefficients 
starting from the seventeenth one. 


Apply the inverse Faber-Schauder discrete wavelet transform to obtain the stego image. 


Extraction algorithm 
e Read the stego image as two dimensional file. 
e Apply the Faber-Schauder DWT to the stego image. 


e Extract the permutation p* from the first eight coefficients and the identifier of the the key ø from the 
second eight coefficients. 


e Extract the binary sequence m’ from the coefficients, and reconstruct m using the function f and permu- 
tation p*. 


e Regroup the binary sequence m by blocks of 8 bits to obtain the hidden message. 


4. EXPERIMENTAL RESULTS AND DISCUSSION 

Experiments were accomplished to assess the performance of the proposed method using a variety of 
512x512 grayscale images of the SIPI database, containing some images which are frequently utilized in tests, 
like ”’ Baboon”, ”Peppers”, Lena” and ”Elaine” (see Fig. 2). 


Elaine 


ee Oe 


Peppers Boat Lena 


Figure 2. Some of the images used in the experiment 


The proposed work is compared with the methods developed by Amin [13], Miri [16] and Al-Dmour 
[17]. The test of the proposed work and the comparison are based on the following metrics [18]: 
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M-1N-1 
S(i,j) — Cli, j 
sa DD C69) 
PSNR = 10 Log (F) ; NAE = AIN 
CS) 
M-1N-1 E TY 
E Y 66H - C69) > NCUA - HeNSE,H) — us) 
IF =1-—*__ ; NCC = — 
2 d, O63" YC) = uA E SGI) - us)? 


The PSNR is the Peak Signal to Noise Ratio, it is calculated using the MSE. The more PSNR increases, 
the more the steganographic scheme is imperceptible. The NAE is the Normal Absolute Error, it measures the 
absolute value of the error between the cover and stego images. Small values of NAE (close to 0) are a sign of 
good imperceptibility. ZF is the Image Fidelity, the quantity 1 — I F measures the ratio of the energy of the error 
between the cover and stego images to the energy of the cover image. Obviously, good imperceptibility requires 
that 1 — IF is very close to 0, which means that JF has to be very close to 1. The Normalized Correlation 
Coefficient NCC is a scalar product of the normalized vectors vc and vg while vc is the cover image minus its 
mean value uç and vg is the stego image minus its mean value jug, so it takes values between —1 and 1. The 
closer NCC is to 1, the more similar are the images. If it is close to 0, the images are uncorrelated, and if it is 
close to —1, the images are said opposite. 


4.1. Test of the proposed method 


To test the proposed method, we used a set of 100 images with different modalities, downloaded from 
the SIPI image database. 


Table 1. Imperceptibility for the proposed method 


Metrics PSNR NAE IF NCC PSNR NAE IF NCC 
Data 3000 bytes 6000 bytes 

Min 61.55 2.0le-4 0.999973 0.999654 58.53 4.0le-4 0.999964 0.999371 
Max 62.31 1.38e-3 0.999999 0.999997 59.25 2.77e-3 0.999998 0.999994 
Mean 61.71 3.69e-4 0.999997 0.999948 58.68 7.42e-4 0.999994 0.999893 
Data 9000 bytes 12000 bytes 

Min 56.76 6.02e-4 0.999921 0.999073 55.52 8.05e-4 0.999893 0.998526 
Max 57.53 4.14e-3 0.999997 0.999987 56.25 5.5le-3 0.999996 0.999982 
Mean 56.92 1.1le-3 0.999991 0.999842 55.67 1.48e-3 0.999988 0.999778 
Data 18000 bytes 24000 bytes 

Min 53.75 1.2le-3 0.999841 0.998147 52.51 1.6le-3 0.999787 0.997565 
Max 54.49 8.29e-3 0.999994 0.999972 53.28 1.lle-2 0.999992 0.999962 
Mean 53.91 2.22e-3 0.999982 0.999671 52.65 2.96e-3 0.999976 0.999554 


Table 1 presents the results of the imperceptibility test for the proposed method, based on the metrics 
PSNR, NAE, IF and NCC. In this simulation, we dissimulated in the 100 test images a text of 3, 6, 9, 12, 18 
and 24 Kilo Bytes. The table gives the minimum, maximum and mean values. A steganography process is 
imperceptible when PSNR is beyond 36 dB. The PSNR values indicate a high level of imperceptibility, NAE 
values are very small, NAE < 10~?, and JF is practically 1, |1 — IF| < 1074. NCC values are very close to 1, 
|1 — NCC| < 1078, which proves that the cover and stego images are practically identical. Figure 3 exhibits 
the evolution of the PSNR for all the test images as the size of the hidden data increases. The PSNR diminishes, 
because when we hide larger data, the error becomes important. However, the drop of the imperceptibility 
becomes slower, when data size increases from 12 Kilo to 18 Kilo and from 18 Kilo to 24 Kilo, the mean PSNR 
decreases by 1.76 dB and 1.26 dB respectively. 
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Figure 3. Capacity-Imperceptibility 


4.2. Comparison to literature 


The proposed work is compared to the methods developed by Amin [13], Miri [16] and Al-Dmour 
[17]. The tests of the comparison respects the same conditions (images, size of hidden data) utilized in these 
works. 

Table 2 shows the results of comparison of PSNR to Amin’s work for the four images used in his 


3 
work, and table 3 compares the capacity of hiding. The proposed work provides a larger capacity -MN — 8 


bits, about 2.3 times the one of Amin, the subtracted 8 bits are reserved to hide the permutation p*. In the 
algorithm he proposed, Amin does not hide data in all the wavelet coefficients, he selects the location where 
to hide data via the zero tree method, hence the capacity is diminished. On another hand, even concerning 
the imperceptibility, the proposed work still has better results, for 100, 500 and 1K bytes, the difference is 
approximatively 1 dB. But, when we hide 5K, 10K and 15K bytes, the difference becomes 3 dB. 


Table 2. Comparison of PSNR to Amin [13] 


Image Method 100 500 1000 5000 10000 15000 
Amin 73.98 66.61 63.64 65.55 53.64 52.02 


Barbara proposed 76.57 69.46 66.38 59.37 56.35 54.59 
Peppers Amin 74.12 66.61 63.78 56.54 53.58 51.89 
Proposed 76.61 69.45 66.42 59.34 56.34 54.57 
Baboon Amin 75.62 68.18 62.89 56.06 53.32 51.75 
Proposed 76.59 69.40 66.38 59.35 56.32 54.56 
tena Amin 73.58 66.07 63.01 56.18 53.38 51.65 


Proposed 76.57 69.42 66.41 59.37 56.37 54.59 


Table 4 shows the results of the comparison of the PSNR to Miri [16] and Al-Dmour [17], we respected 
the size of hidden data used in [16]. The proposed work has higher values. For Miri, the difference is around 
3.6 dB. In fact, Miri may hide data in more than one bit on a wavelet coefficient depending on the weight 
(position) of the most significant bit, the greater is the position, the more bits of the coefficients are used to 
embed data, in this case, the error generated from the dissimulation increases, which affected his PSNR values. 
As for Al-Dmour [17], the difference starts with 3 dB, authors hide data in the edge coefficients and use the 
XOR cording in order to minimize the error of the dissimulation. However, as the size of data increased, more 
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Table 3. Comparison of the hiding capacity to Amin [13] 


Image Size Amin [13] Proposed 
Lena 128x128 5145 11891 
Lena 256x256 20622 48371 
Lena 512x512 82578 195059 
Peppers 128x128 5223 11891 
Peppers 256x256 20694 48371 
Peppers 512x512 83846 195059 


Table 4. Comparison of PSNR to Miri [16] and Al-Dmour [17]. 


Data size Al-Dmour Miri Proposed 
6300 bits 64.76 63.80 67.44 
12800 bits 61.50 60.66 64.32 
28800 bits 56.91 56.79 60.78 
51200 bits 52.62 54.78 58.28 
67700 bits 50.28 53.68 57.06 


bits of the edge coefficients are used to dissimulate data (and depending of the cover image complexity, more 
bits of the coefficient may be used), which decreases significantly the PSNR. Hence, the difference enlarged to 
about 7 dB since in our case, we use only one bit in each coefficient, and the optimal permutation p* transforms 
the message into the best match for the cover image. 


5. CONCLUSION 

In this paper, a steganographic method based on Faber-Schauder DWT is proposed. Data is divided 
into pairs of 2 bits, the same is done to the LSB of the details in the transform domain. We establish a matrix 
that calculates the number of times where data and the coefficients are similar or opposite, and based on this 
matrix we find the permutation that transforms the message into the binary sequence that provides the most 
match possible to the coefficients LSBs. Results showed good trade-off between capacity and imperceptibility, 
and higher values in both of them compared to existing methods. In our future works, we will study more how 
to minimize the error generated by the dissimulation and we will strengthen the security through the analysis 
of the hiding’s effect on the histogram. 
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